Universal Chemical Markup (UCM) - A new format for common chemical data

نویسندگان

  • Jan Mokrý
  • Miloslav Nic
چکیده

Background We wish to introduce a new chemical format called UCM (Universal Chemical Markup). The format is based on XML (Extensible Markup Language) and its first version focuses on recording chemical structures and their properties. Results UCM currently supports structures containing isotopes, ions and various types of bonding including delocalized bonds. Properties can be expressed by combining UCM with UnitsML (Units Markup Language). Using UnitsML one defines quantities with scientific units, and then refers to them in UCM when recording property values. Users can also add literature references with BibTeXML (BibTeX Markup Language) and annotate the recorded data using plain text or XHTML (Extensible Hypertext Markup Language) descriptions. In contrast to presently available general-purpose chemical formats, UCM offers built-in validation, which combines both grammar and pattern-based XML schema languages. Thus, all recorded data can be precisely validated by UCM schemas in standard XML validators. Conclusions We developed the structure for UCM from scratch on the basis of an analysis described in our previous article. Starting from scratch allowed us to integrate BibTeXML, UnitsML and XHTML as well as chemical line notations and identifiers into UCM. It also helped us to avoid unnecessary redundant parts and create the implementation that aims to minimize ambiguity and is designed to be easily extensible in the future.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing Universal Chemical Markup (UCM) through the reusable methodology based on analyzing existing related formats

Background In order to design concepts for a new general-purpose chemical format we analyzed the strengths and weaknesses of current formats for common chemical data. While the new format is discussed more in the next article, here we describe our software tools and two stage analysis procedure that supplied the necessary information for the development. The chemical formats analyzed in both st...

متن کامل

The semantics of Chemical Markup Language (CML) for computational chemistry : CompChem

: This paper introduces a subdomain chemistry format for storing computational chemistry data called CompChem. It has been developed based on the design, concepts and methodologies of Chemical Markup Language (CML) by adding computational chemistry semantics on top of the CML Schema. The format allows a wide range of ab initio quantum chemistry calculations of individual molecules to be stored....

متن کامل

Open Babel: An open chemical toolbox

BACKGROUND A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry...

متن کامل

A New Software Environment for MBS Simulation Based on XML and Integrated with CAD/CAE Packages

This paper presents a new software environment for MBS simulation based on XML (eXtensible Markup Language), the universal format for structured documents and data on the World Wide Web. The core of this environment is MechML, a work-in-progress language to describe MBS data and related information based on XML. MechML takes full advantage of the latest developments in data modeling and offers ...

متن کامل

Ketcher: web-based chemical structure editor

Ketcher is an open-source web-based chemical structure editor developed by SciTouch LLC. Being written on pure JavaScript it incorporates high performance, good portability and light weight. Editor supports late versions of all most popular browsers, such as Internet Explorer, Firefox, Safari, Opera and Chrome. It is developed with minimum third-party code to remain its light weight. It is plat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PeerJ PrePrints

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2015